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Motivated by the recent discovery of a quantum Chernoff theorem for asymptotic state discrim- 
ination, we investigate the distinguishability of two bipartite mixed states under the constraint of 
local operations and classical communication (LOCC), in the limit of many copies. While for two 
pure states a result of Walgate et al. shows that LOCC is just as powerful as global measurements, 
■ data hiding states (DiVincenzo et al.) show that locality can impose severe restrictions on the 

distinguishability of even orthogonal states. Here we determine the optimal error probability and 
, measurement to discriminate many copies of particular data hiding states (extremal d x d Werner 

("S . states) by a linear programming approach. Surprisingly, the single-copy optimal measurement re- 

mains optimal for n copies, in the sense that the best strategy is measuring each copy separately, 
\Q ' followed by a simple classical decision rule. We also put a lower bound on the bias with which states 

can be distinguished by separable operations. 

I. INTRODUCTION 

The non-classical nature of information represented in states of a bipartite quantum system is 
strikingly evident in the fact that, even allowing the experimenters (Alice and Bob) holding each 
of the subsystems to use local operations and classical communication (LOCC) freely, they cannot 
, access the information as well as if they were in the same lab or could exchange quantum states. 

J> ■ Thus, there is a specifically quantum obstruction to the distributed analysis of data and investigating 

C*") | this obstruction is a way of obtaining an understanding of the quantum nature of information. 

The problem of LOCC discrimination of two or more states, has recently attracted quite consid- 
erable attention [J H i, H [|, H, 0, H, ©, El El El El and what can be said at the very least that 
it is difficult. In the simplest example, the experimenters are given one of two states at random 
according to some probability distribution and their task is to unambiguously determine which state 
they have with the smallest possible error probability. Throughout this paper we'll use P* r (pi ,pi\P) 
to denote the minimum error with which the states p\ and p2, with prior probabilities p and 1 — p 
respectively, can be distinguished by a POVM that can be implemented by operations in the class 
X. It will sometimes be convenient to refer to the optimal bias (over random guessing) instead of 
; the optimal probability. This we define, as usual, by 

B x = 1 - 2P*. (1) 

In this work we will talk about the well known classes of PPT-preserving (PPT) operations, separable 
(SEP) operations [14l| and local operations with classical communication (LOCC), which obey the 
strict inclusions [15| 

LOCC c SEP c PPT c ALL , (2) 

where ALL simply denotes the set of all possible global operations. Briefly, the POVMs which can 
be implemented by operations in these different classes can be characterized as follows: An LOCC 
POVM is one which can be implemented as a multi-round process where each round consists of a 
partial measurement of one party, which can depend on previously generated classical messages, and 
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whose result is broadcast; A POVM is in SEP if and only if its elements can be written as positive 
linear combinations of product operators; A POVM can be implemented by PPT operations if and 
only if its constituent operators have positive partial transpose. The inclusion structure immediately 
implies the ordering 

^.? CC > P e s r f > p ppt > = \ - \ ||m - (i -v)pAx ■ (3) 

The final equality is the classic result of Helstrom and Holevo [l6| . A similar closed form expression 
does not seem to exist for P^ cc or any of the other bipartite P* r . 

Motivated by the recent development of a quantum Chernoff theorem [TtJ , we are interested here 
in the asymptotic behaviour of the quantity P* r (pf n ,pf n ;p) as the number of copies, n, goes to 
infinity. We can define the Chernoff distance with respect to a class of operations X, between the 
states p\ and p 2 by 

£ x (p u p 2 ) = lim — logP* (pf n , P f n ;p) . (4) 

n — >oo n 

(We note that the Chernoff distance is not strictly a distance since it does not obey the triangle 
inequality and that it is independent of the prior probabilities as long as they are both non-zero.) 

In [13], it was determined that the (unconstrained) quantum Chernoff distance £ ALL (pi,p 2 ) is 
given by the formula (note the independence of p) : 

C ALL (Pi,p 2 ) = - min logTr p\- s p s 2 . (5) 

0<s<l 

This is a pleasantly straightforward generalisation of the classical Chernoff theorem for probability 
distributions, where for probability distribution vectors p and q 

n 

Z (P. l) = - n mi n log ■ ( 6 ) 

0<s<l * — * 
~~ ~ i=l 

It is useful to define yet another Chernoff distance on quantum states, for an even more restricted 
class of measurements than LOCC. Let (M, 1 — M) be the optimal single-copy LOCC POVM. 
£ (pi, P 2',p) is the classical Chernoff distance between the probability distributions on the outcome 
of this measurement when it isperformed on pi or p 2 . (Outside the bipartite setting this notion was 
considered before by Kargin [18|.) If we write 

Pjl = Tr (M Pj ) , p j2 = Tr ((1 - M) Pj ), (7) 

we can summarize the relationships between Chernoff distances we have defined as follows: 



- min log Yp\r s p S 2i = e C < £ LOCC < f EP < £ PPT < £ ALL = - min logTrp}-^ (8) 

- _ i=l ~ _ 

Before proceeding with our main new results, we would like to make some general remarks about 
these quantities and describe some of the existing knowledge about them. One striking difference 
between global and local state discrimination can be seen in the effect of adding an ancilla. In 
the global case, this has no effect on our ability to distinguish between states, asymptotically or 
otherwise. That is, for any state r 

P^ h {p uP2 -p) =P^(p 1 ®T,p 2 ®T;p), t; ALh (p 1 ,p 2 ;p)=t AhL (p 1 ®T,p 2 ®T;p). (9) 

This is hardly surprising when one considers that the addition of any ancilla state is subsumed by 
the POVM formalism in the global case. In cases where our ability to distinguish between two states 
(of a d X d system, let's say) is worsened by restriction to LOCC, then we will indeed be helped 
by the provision of a d x d maximally entangled ancilla: by using it to teleport Alice's half to Bob 
(say), we have restored the ability to make global measurements and will be able to decrease the 
error probability accordingly. It is not always the case that the restriction to LOCC will impair our 
performance however. It was shown by Walgate et al. [l| (and generalized to non-orthogonal states 
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by Virmani et al. Q) that LOCC can do just as well in distinguishing between two pure states as 
a global measurement can. 

p^ LL (ivm \4M\p) = ^? cc (MM, \4M\p) ■ (io) 

Naturally, the corresponding Chernoff distances are also equal when both states are pure. Recently, 
Nathanson [l9j has generalized this to the case of discriminating a mixed state from a pure state. 
He finds that under certain conditions on the fidelity of the states and the Schmidt coefficients of 
the pure state, £, LOCC (pi, P2) = £ ALL (pii P2), even though the single-copy error probabilities may 
differ. 

From our perspective, it is more interesting to look at pairs of states where the LOCC constraint 
reduces our ability to distinguish them. In this paper we discuss an example of such a case. Let 
ad and ad denote the completely symmetric and completely anti-symmetric Werner states in d x d 
dimensions, respectively (when d is a power of two, these are the states used by DiVincenzo et 
al. [2(| for "data hiding"; see also [HI). I n this paper we calculate the Chernoff distance between 
these states, £ LOCC (cr d , ad), and to do so, we actually give an expression for P^p 00 (<rf n , af n ;p). 

The rest of this paper is organized as follows: In the next section we present an LOCC protocol 
which puts an upper bound on P^ 00 (erf™, a® n ;p). In section Inll we formulate the minimization 
of the error which can by achieved by PPT operations as a linear program, and by solving the dual 
program show that the LOCC upper bound is also a lower bound on P p f T (cf n , a® n ;p) and hence 
on P e ^P cc (af n ,af n ;p), thus proving the optimality of our LOCC protocol, and allowing us to 
calculate the Chernoff distance. In section HVl we prove a lower bound on P SEP (px,p2',p) in terms 
of P ALL (pi,p2;p), after which we conclude. 

To describe asymptotic behaviours we will use 'Big-O' notation (including O, fl and ~). If X is 
an operator on a bipartite Hilbert space Ha (^Ti-B, we use X T to denote its partial transpose, which 
is defined (for some orthonormal product basis {\i)a <8> |i)s}) by 

\i)A ® |j'M fc U ® (1\b = \i)A ® \l)B(k\ A ® (j\B- (11) 



II. LOCC DISCRIMINATION PROTOCOL 

Proposition 1. There is an LOCC protocol (requiring only one-way communication) which demon- 
strates that P c L r ? cc (af n , af n ; p) < min (p (jfikY , 1 - p) ■ 

Proof. Alice and Bob take each copy in turn and measure in the computational basis. They share 
their results. If they recorded different results for every copy then they guess that they have the 
anti-symmetric state. Otherwise, they have obtained the same result for at least one state and they 
know with certainty that they share the symmetric state. 

For a single copy, the POVM implemented by this measurement is 

{d-l d-1 ~) 

Gd = Y. ly'Xtf 1. 1 - G d = E l"X«l • (12) 
ijtj i=0 ) 

Because the states to be distinguished are both U <£> [/-invariant, it is convenient to apply the 
twirl operation to the two operators in the POVM and it also emphasizes the symmetry of the 
states that are to be distinguished. After doing so we have the following single-copy POVM of equal 
performance: 

{Md = d ^u s + u a ^-M d = -^-n s }, (13) 

where n s and II a are the projections onto the symmetric and anti-symmetric subspaces, respectively. 
The POVM element Md corresponds to Alice and Bob having different measurement outcomes on 
a single copy. For n copies the POVM is 

{Mf n , 1 - Mf n }, (14) 
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since Mf n corresponds to Alice and Bob getting different outcomes for every copy they measure. 
Let A k denote the sum of all elements of {IT, W. a \® n which have k copies of Il a . Expanding in terms 
of the n + 1 orthogonal projection operators {A 0l . . . , A n }, we find that 




P err = pTr (M®V|«) + (1 -p) Tr ((l - Mf) af ™) , (16) 

where the first term is the probability that Alice and Bob have the symmetric state and mistake it for 
the anti-symmetric state and the second term is the probability that they share the anti-symmetric 
and mistake it for the symmetric state. 

Substituting (fT5|) into (fT!))) and using the fact that af n <x A and a® n cx A n , we obtain 

Pc - = p Tr ( (ztt) " Aoafn ) + ( 1 ~p^ Tr K 1 - A ^ °D = p (f^j) " • ( 17 ) 

If P err > 1 — p then we will do better to simply guess that we have the symmetric state all the time. 
Adding this proviso to our strategy, we obtain the desired result. □ 

Remark 2. We note that the second term in the expression for the error probability is zero, meaning 
that all the error is due to the case where the symmetric state is mistaken for the anti- symmetric 
state. This is just what we would expect given that our protocol reports that we have a symmetric 
state only when it is certain that we have one. 

We shall now show that (fT7|) is the optimum error probability that can be achieved using LOCC 
by showing that it is the best that can be achieved even if we use the larger class of measurements 
that can be implemented using PPT preserving operations. 



III. OPTIMAL PPT PRESERVING POVM 

We shall first formulate the minimisation of the error probability over PPT preserving POVMs [lij 
as linear programming problem (see [22j | , for instance) by taking advantage of the symmetries of the 
states we wish to distinguish. We will then show that there is a solution to the dual linear program 
which lower bounds the error probability to exactly that achieved by the LOCC procedure given 
above. 

The states a® n and o~f n are invariant under permutations of the copies and under biunitary 
transformations of the individual copies. We can assume therefore that our two POVM elements 
have the same symmetries (this is a trick that was used before in [24j to solve a relative entropy 
minimisation problem). The most general operator with these symmetries is a linear combination 
of the ri + l operators A k which we defined above, so we write our POVM as: 



lj2 x kA k ,J2( 1 " x k)A k \ . (18) 

I fc=0 k=0 ) 

The probability of error is given by 

Perr = P Tr ^ x k A k af + (1 -p) Tr ^(1 - x k )A k a® A = (l-p)+p (x - ^*») ■ (19) 
The constraints 

x k > for k = 0,..., n, (20) 
x k < 1 for k = 0, . . . ,7i (21) 

are necessary and sufficient to ensure that the two operators do in fact comprise a POVM. 
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The partial transpose of the flip operator F is equal to where $d = g 2j,j=o NXii'l ^ s tne 
maximally entangled state. Since n s = (1 + F)/2 and LT a — (1 — F)/2, we have 

i£ = 1 (1 + = - $ d ) + (1 + d) d> d ) , (22) 

lf a = \ (1 - = - $ d ) + (1 - d) (23) 

so the operators A\ can be written as linear combinations of operators from the set of 2" orthogonal 
operators {(l-$ d ), $4®". 

Let SJ! denote the subset of strings in {0, 1}^ which have exactly k ones. Then, 



(24) 



Z=0 0<j<l,k 

where 7] is the sum over all elements of {(1 — $d) , which have I copies of &d- 

A POVM is PPT preserving if and only if all of the operators that comprise it have positive 
partial transpose [l4j. A necessary and sufficient condition for the POVM to be PPT preserving is 
therefore given by the following inequalities 

X> E (lZ l )( l )( 1 -^ j ( 1 + ^ j ^° forl = 0,...,n, (25) 

fe=0 0<j<Lk ^ 3 J \JJ 

E(l-x fe ) Yl (l-^ + d) ,-i >0 far 1 = 0,..., n. (26) 

Let Q be an (n + 1) x (n + 1) matrix with elements 

S (::•)© (27) 



Qa- — 

o<i<z,fc 



We note that 



. m I ^— ' V 7/ V 1 + cf 

fe=0 ra=0 v 7 j=0 VJ/ v 

1 — a \ x In — I 



< 1+< <>' 1 + TT1 E( „ (28 » 



1 + d J ^— ; V m 

7 m=0 v 
= (l + d)<(^)V<=2». 

Defining the vectors c and 6 as follows 

1 -p 

C 4 = 5 j £„j, (29) 

P 

for i = 0, . . . , n, 

-2" fori = n + l,...,2n + l, (30) 
-1 fori = 2n + 2, . . . , 3n + 2, 

we can write the optimisation in standard linear programming form 

mm{c T -x\P ■ r !>. r > ()j where P = | -Q \ CM) 
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Writing (fT9]l in terms of the objective function c T • x, we see that the POVM corresponding the 
vector x has error probability 

P m (x) = (l-p)+pc T -x. (32) 
Proposition 3. The probability of error for a PPT preserving POVM to distinguish cr®" and a®" 
with prior probabilities p and 1—p, Pc r f T (^f"™? a f "if) > * s bounded below by min [p f^qrj-J , 1 — . 

Proof. The linear program dual to ([31]) is just 

max{6 T • y\P T ■ y < c, y > 0}. (33) 
v 

Indeed, the duality of linear programs tells that for any primal feasible point x and any dual 
feasible point y 

c T -x>b T -y, (34) 
so any dual feasible point y gives us a lower bound on the error probability: 

pir (^r.«r;p) > a-p) +j* t • (35) 

It is convenient to write y as the direct sum of three (n + l)-dimensional vectors y — u(Bv(Bwso 
that we can rewrite the dual program as 



n n 



max < -2" ^Vi-^Wi\u>0,v>0,w>0,Q T -u-Q T -v-w<c\ . (36) 
v [ »=o »=o J 

Consider the point y* = u* © v* © w* defined by 

n\ (d-l) n - l ({d+iy - {1- d) 1 ) 



ij (2d) n {d + iy ' ^ 

v* = 0, (38) 

f for i = 0, . . . ,n — 1, 

Wj * = 1 max(i^-(^ T i)",0) fart = n. (39) 



We show that the point y* is dual feasible in Appendix A. The dual objective function at this 
point is 



n n 



-2"E<-E< = -<=-HUti ' (40) 

i=o i=o \\ ' / y / 

so, substituting y* into (|35[) . we obtain the bound: 

^ PT (oT, "TIP) > (p (j^J A-?)- (41) 

□ 

Corollary 4. Substituting the results of Proposition [7] and Proposition [3| into fifi), we have shown 
that 

p?r (or,«r;p) - (*r.«r;p) = ^? cc K-,«r^) = (* (^) n , 1 - P ) . 

(42) 

Substituting into the definition of the Chernoff information for each class of operations and noting 
that each copy is measured separately in the optimal strategy, we obtain our main result: 

Theorem 5. Whenever < p < 1, we have 

? ppt ^ ad) = e SEP ^ Qd) = ^locc ((Jd) ad) = e sc ^ Qd) = log d±l „ _ (43) 
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IV. A LOWER BOUND ON BIAS FOR 
SINGLE-COPY SEPARABLE MEASUREMENTS 



The fact that £ LOCC (a d ,a d ) = £ SC (a d ,a d ) shows that our ability to distinguish the extremal 
Werner states cannot be improved by measurements which are entangled across copies. This is the 
least favorable many-copy behaviour possible. It would be interesting to know if the single-copy 
error probability for these states also has the worst kind of scaling with dimension. In terms of bias, 
we have shown that 



B wcc (a d ,a d ;p) 
B ALL {a d ,a d ;p) 




Is 1/d an asymptotic lower bound whatever states we choose? If we relax the LOCC constraint and 
allow separable operations then we can show that it is. 

Proposition 6. If pi and p 2 are bipartite states on a system of overall dimension D, then 

B SEP (p 1 ,p 2 ;p)>-±=B A ^(p ll p 2 ;p). (45) 
zv JJ 

Proof. We know that the optimal error probability for global measurements is given by the Holevo- 
Helstrom POVM, the elements of which are generally not even PPT. It was shown by Barnum and 
Gurvits [23[ that every Hermitian operator in the ball centred on the identity, with radius one in the 
Hilbert- Schmidt norm is separable. If we add to each element of the Holevo-Helstrom POVM the 
minimum amount of the identity operator necessary to put the resulting operator inside this ball, 
and normalize the POVM, we obtain the separable POVM 



If M \ 1 / M\l 

where M is the projector onto the support of the positive part of (1 — p)p 2 — PPi if P < 1/2 (and 
minus one times the projector onto the support of the negative part otherwise). This POVM yields 
the error probability 

Pc - = K 1_ 2Mh (|1 ~ 2p| + ll(1 ~ p)/32 ~ mlll) ) ' (47) 

Using the fact that ||M|| 2 < V~D, we get the bound 



|(1 -p)p 2 -pp 1 \\ l = B ALL > B PPT > B SEP > -4=||(1 -p)p 2 -pp 1 \\ 1 = -^==B ALL . (48) 



2VD 2VD 

□ 

So, for states of a d x d system: B SEP /B AljL € £1(1/g?). This result, combined with our result for 
the the data hiding states, leads us to conjecture that 

Conjecture 7. For states on a d x d system, 

jgLOCC 



To put the insights and conjecture above into a different and wider perspective, let us look at the 
biases B x for the particular value p = \: 

B x ( Pl ,p 2 ):^B x ^p 1 ,p 2 ;^y (50) 

for which, by definition, it is clear that it is symmetric: B x (pi, p 2 ) = B x (p 2 , P i). Furthermore, for 
all the classes X considered in the introduction, B x ( P i, p 2 ) — if and only if p\= p 2 . Indeed, the 
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B x are all metrics, as they obey the triangle inequality: B x (pi,p 3 ) < i? x ( j oi,p 2 ) + B x {p2,pz) for 
any states pi, P2 and p$. To be more precise, they derive from operator norms || • \\x, defined on 
trace-free hermitian operators: 

,with||M|| x = sup VlTrA/Mil, (51) 
x povm (Mi)ieX \ 

We note that the supremum in (|51[) is always attained by a POVM with two elements (one with 

Tr (MMi) > and the other with Tr (MM 2 ) = - Tr (MMy) < 0). 

For example by Helstrom's theorem [l6j], B x (pi,p 2 ) = \\\(pi — P2) || 1? so || • ||all = || ■ ||i- 

Of course, all norms on finite-dimensional spaces are equivalent up to constant factors. Eq. ()48|) 

translates into the ordering of norms 

||M||i = ||M||all > ||M||ppt > ||M||sep > yj||M|| ALL , (52) 

and Conjecture [7] can be expressed as ||M||locc > ^ (l) 1 1-^1 1 all for d x d systems. Note that 
the existence of data hiding states implies that this would be essentially best possible, as for M — 
\{a.d - (Jd), 

||M|| LO cc < ||M Hsep < ||M||ppt = ^-IIMHall. (53) 
V. DISCUSSION 

We have calculated the Chernoff distance between the extremal d x d Werner states, under the 
constraint of LOCC operations, for all values of d. This is the first time the LOCC Chernoff 
distance has been calculated for states where it differs from the unconstrained Chernoff distance. 
In this case, we have also been able to calculate the smallest error probability that can be achieved 
by LOCC for any finite number of copies. The solution has at least two remarkable features: First, 
the error probability is - up to constant factors - equal to the n-th power of the single-copy error 
probability, showing that in a sense n copies don't give disproportionate advantage over one copy, 
in this case. Secondly, even the optimal n-copy measurement reflects this structurally; namely, it 
can be implemented by measuring the single-copy optimal POVM n times, followed by a trivial 
classical post-processing. As discussed in the introduction, this is a "worst-case" strategy for many 
copies. Both of these properties distinguish the solution from what is to be expected in the quantum 
Chernoff problem: e.g., discriminating two (non-orthogonal) pure states has a very simple optimal 
strategy, but for n copies (which is also a problem of discriminating two pure states) this strategy 
is highly collective over the n systems. Also, in general, even classically, the error probability shows 
only an asymptotically exponential decay, but here it is exactly exponential. 

Our result also leads to a number of further questions. An extension of the work which we are 
currently considering is to see if we can find Chernoff bounds for the discrimination of pairs of 
general Werner states. Preliminary and ongoing investigations suggest that some interesting effects 
occur when at least one state is non-extremal. Also, as discussed above, it would be interesting to 
know how close to "worst possible" is our example in terms of comparing LOCC to unrestricted 
measurements? That is, we would like to resolve our Conjecture [7] on the single-copy LOCC bias. 
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APPENDIX A: PROOF OF DUAL FEASIBILITY 



We note that u* > for i = 0, . . 



, n: 



k 



n\ ( d — 1 



fc/ V 2d 



(d-l) fc (d+l) fc 



> 0. 



It is obvious that v* > and w* > 0, so the first three inequalities of (|36|) are satisfied. 
We now show that the remaining inequality, 



} T ■ u — Q T ■ v — w < c, 



is also satisfied: 



— si(d, n; I) — S2(d, n; I), 

where 



si(d, n; fc) 



Defining m = I — j we can rewrite the first sum (|A4|) as 

_ (rf-i) m y y »! f i V Y d+1 

(2d) n (k- i)!(n- (m + fc))!m!j! v ; V^- 1 

_ (d - 1)" A n! (n-m)! fc! / d + l 

(2d)" ^ ^ (ra-m)!m! ((n-m) -fc)!fc! (fc- ' \d-l 

-^lG0("r)(^)gC)^ 

The sum over j is except when fc = 0, so 

(d- 1)" ■ -A /n\ /d + l\ m (d-1)™/ d+1 



(2d)" ^Vm/V^- 1 / ( 2d )" V d-1 



11 



Making the same change of variables (m = I — j) in (|A5|) , we obtain 

= (rf-i)" v V ^ r-ivwr-iv ("^ x J 

( 2 °0 n ^io( fc -^ ! ( n -( m + fc )) !m! ^' ! v d + 1 / (VS) 

= * n! ( n -fc)! fc! 

(2d) n (n- fc)!fc! ((ri- fc^ -to)!to! (fc- j)!jP j V rf +! 



(2d)™ V fc 



g("^)MrgG)(^) 



(d-lf/nA^/n-feV ^fk\fd-l 



m=0 x 7 j=0 



(2d)™ V fc / V V rf + 1 

(\ n 
j^j J , so the constraint 

1[X2]) is satisfied: 

(Q T • u* - Q T ■ v* - w*) fc = (5 0fc - 8 nk - max (r - (A^) > °) ^ ^ c fc- ( A9 ) 



